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Abstract. We give a general method based on dyadic Calderon- 
Zygmund theory to prove sharp one and two- weight norm inequal- 
ities for some of the classical operators of harmonic analysis: the 
Hilbcrt and Ricsz transforms, the Beurling-Ahlfors operator, the 
maximal singular integrals associated to these operators, the dyadic 
square function and the vector-valued maximal operator. 

In the one-weight case we prove the sharp dependence on the 
Ap constant by finding the best value for the exponent a{p) such 
that 

< c„,T h?^ii/iil^w- 

For the Hilbert transform, the Riesz transforms and the Beurling- 
Ahlfors operator the sharp value of a{p) was found by Petermichl 
and Volberg [47, 48, 49]; their proofs used approximations by the 
dyadic Haar shift operators. Bellman function techniques, and two- 
weight norm inequalities. Our proofs again depend on dyadic ap- 
proximation, but avoid Bellman functions and two-weight norm 
inequalities. Wc instead use a recent result due to A. Lerner [34] 
to estimate the oscillation of dyadic operators. By applying this 
we get a straightforward proof of the sharp dependence on the Ap 
constant for any operator that can be approximated by Haar shift 
operators. In particular, wc provide a unified approach for the 
Hilbert and Ricsz transforms, the Beurling-Ahlfors operator (and 
their corresponding maximal singular integrals), dyadic paraprod- 
ucts and Haar multipliers. Furthermore, we completely solve the 
open problem of sharp dependence for the dyadic square functions 
and vector-valued Hardy-Littlewood maximal function. 

In the two- weight case we use the very same techniques to prove 
sharp results in the scale of Ap bump conditions. For the singular 
integrals considered above, we show they map LP{v) into L^iu), 
1 < p < oo, if the pair (u, v) satisfies 

snp\\u'/P\\A,Q\\v-'^n\B.Q<^, 
Q 

where A £ Bpi and B g Bp are Orlicz functions. This condition 
is sharp. Furthermore, this condition characterizes (in the scale 
of these Ap bump conditions) the corresponding two- weight norm 
inequality for the Hardy-Littlewood maximal operator M and its 
dual: i.e., M : Lp{v) — > LP{u) and M : Lp'{u^-p') — > LP{v^-p'). 
Muckenhoupt and Wheeden conjectured that these two inequali- 
ties for M are sufficient for the Hilbert transform to be bounded 
from LP{v) into LP(u). Thus, in the scale of Ap bump conditions, 
we prove their conjecture. We prove similar, sharp two-weight re- 
sults for the dyadic square function and the vector- valued maximal 
operator. 
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1. Introduction 

The problem of proving one and two-weight norm inequahties for 
the classical operators of harmonic analysis — singular integrals, square 
functions, maximal operators — has a long and complex history. In the 
one weight case, the (nearly) universal sufficient and (often) necessary 
condition for an operator to be bounded on L^iw) is the Ap condition: 
given 1 < p < OO, a weight w (i.e., a non-negative, locally integrable 
function) is in Ap if 

[w]ap = sup w{x) dx^ ^J- w{xY~^ dx^ < oo, 

where the supremum is taken over all cubes in and j^Qw{x) dx = 
\Q\~^ JqwIx) dx. For more on one-weight inequalities we refer the 
reader to [13, 18, 21]. 

An important question is to determine the best constant in terms of 
the Ap constant [wj^p- More precisely, given an operator T, find the 
smallest power a{p) such that 

l|r/iu.(..) < a,TM?VlU.H. 

This problem was first investigated by Buckley [3] . More recently, it has 
attracted renewed attention because of the work of Astala, Iwaniec and 
Saksman [1]. They proved that sharp regularity results for solutions to 
the Beltrami equation hold provided that the Beurling-Ahlfors operator 
satisfies a(p) = 1 for j> > 2. 

The problem of characterizing the weights that govern the two- weight 
norm inequalities for classical operators is still open and there are sev- 
eral approaches to finding sufficient conditions on weights for an oper- 
ator to be bounded from Lp(v) to W^u). One approach is to replace 
the two-weight Ap condition with the Ap "bump" condition: 

sup ||n^/^|U,Q||^^"^^''||B,Q < OO, 
Q 

where A and B are Young functions and the norms are localized Orlicz 
norms slightly larger than the W and norms. (Precise definitions 
will be given below.) Sufficient growth conditions on A and B are 
known for many operators and this has led to a number of conjectures 
on sharp sufficient conditions. For the history of this approach we refer 
the reader to [5, 7, 8, 10, 11]. 

In this paper we develop a unified approach to both of these prob- 
lems and the results we get are sharp. We consider one and two-weight 
norm inequalities for singular integrals, maximal singular integrals, the 
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dyadic paraproduct, the dyadic square function and the vector- valued 
maximal operator. The results in the one-weight case for singular in- 
tegrals are not new, but we believe that our proofs are simpler than 
existing proofs. The remaining theorems, however, are all new. 

We believe that our approach shows that there is a deep connec- 
tion between sharp results in the one and two-weight case. Further, 
key to our approach is that the operators are either dyadic or can be 
approximated by dyadic operators (e.g., by the Haar shift operators 
defined below). Thus our results will extend to any operator that can 
be approximated in this way. 

Singular integrals. It is conjectured that if T is any Calderon-Zyg- 
mund singular integral operator, then for any p, 1 < p < oo, and for 

any w G Ap, 



This inequality is true if T is the Hilbert transform, a Riesz transform 
or the Beurling-Ahlfors operator. 

Theorem 1.1. Given p, 1 < p < oo, ifT is the Hilbert transform, a 
Riesz transform or the Beurling-Ahlfors operator, then for all w ^ Ap 
inequality (1.1) holds. 

This result was first proved by Petermichl [47, 48] and Petermichl and 
Volberg [49]. For each operator the proof requires several steps. First, 
it is enough to prove the case p = 2; the other values of p follow from 
a version of the Rubio de Francia extrapolation theorem with sharp 
constants due to Dragicevic et al. [12] (Theorem 2.2 below). Second, 
for each of the above operators the problem is reduced to proving the 
weighted inequality for a corresponding dyadic operator by proving 
that the given operator can be approximated by integral averages of 
the dyadic operators (and their analogs defined on translations and 
dilations of the standard dyadic grid). Finally, the desired inequality 
was proved for each of these dyadic operators using Bellman function 
techniques and two-weight norm inequalities. 

Recently, Lacey, Petermichl and Reguer a- Rodriguez [28] gave a proof 
of the sharp A2 constant for a large family of Haar shift operators that 
includes all of the dyadic operators needed for the above results. Their 
proof avoids the use of Bellman functions, and instead uses a deep, two- 
weight "T6 theorem" for Haar shift operators due to Nazarov, Treil and 
Volberg [40]. 

We give a different and simpler proof that uses approximation by 
dyadic Haar shifts but avoids both Bellman functions and two-weight 
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norm inequalities such as the Tb theorem. Instead, we use a very 
interesting decomposition argument based on local mean oscillation 
recently developed by Lerner [30] to prove the corresponding result for 
dyadic Haar shifts. Intuitively, this decomposition may be thought of 
as a version of the Calderon-Zygmund decomposition of a function, 
replacing the mean by the median. (We will make this more precise 
below.) Theorem 1.1 was announced in [6]. 

Remark 1.2. After this paper was completed we learned of several other 
related results. First, Vagharsyakhan [52] has shown that in one dimen- 
sion, all convolution-type Calderon-Zygmund singular integral opera- 
tors with sufficiently smooth kernel can be approximated by Haar shifts. 
Second, Lacey et al. [25] used a deep characterization of the one-weight 
problem in [45] to prove Theorem 1.1 for all singular integrals with suf- 
ficiently smooth kernels. Third, Lerner [35] proved Theorem 1.1 for any 
convolution-type Calderon-Zygmund singular integrals provided p > 3 
orl<p<3/2. Finally, Hytonen [24] proved Theorem 1.1 for all sin- 
gular integrals and all p > 1, thus solving the so-called A2 conjecture. 
His proof is extremely technical: it is based on the approach in [45] 
and a refinement of the arguments in [28]. A simpler proof of the A2 
conjecture based upon the previous three papers appears in [26]. 

An important advantage of our approach is that it also yields sharp 
two-weight norm inequalities. To state our result we need a few def- 
initions. A Young function is a function A : [0, 00) — )■ [0, 00) that is 
continuous, convex and strictly increasing, A(0) = and A{t)/t — )• 00 
as t —7- 00. Given a cube Q we define the localized Luxemburg norm 
by 

\\f\U,Q = inf |a > : £ A i^-^) < 1 
When A(t) = t^, 1 < p < 00, we write 

\\f\\p,Q={J^\mfdx 

The associate function of A is the Young function 

A{t) = sup{st - A{s)}. 

s>0 

A Young function A satisfies the Bp condition if for some c > 0, 

/■°° A(t) dt 

Jc tP t 
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Important examples of such functions are of the form A{t) = t^log(e + 
e > 0, which have associate functions A{t) ~ log(e + t)^ -i+<5^ 

6>0. 

Theorem 1.3. Given p, 1 < p < oo, let A and B he Young functions 
such that A G Bpi and B G Bp. Then for any pair of weights {u,v) 
such that 

(1-2) sup||ni/P|U.Q|K'"'/l|B,o<oo, 

Q 

we have that 

(1-3) r/||L.H < 

where T is the Hilbert transform, a Riesz transform, or the Beurling- 
Ahlfors operator. 

Condition (1.2) is referred to as an Ap bump condition: when A{t) = 
and B{t) = f', we get the two-weight Ap condition. Theorem 1.3 
was proved in [5] for the Hilbert transform in the special case that 
A{t) = tP\og{e + typ~'^^\ 6 > (here A G Bp>), and for Riesz trans- 
forms (indeed, for any Calderon-Zygmund singular integral) given the 
additional hypothesis that p > n. Examples (see [7, 8]) show that in 
this particular case these results are sharp, since they are false in gen- 
eral if we take 6 = (when A ^ Bp'). Theorem 1.3 was proved for 
the Hilbert transform and general singular integrals when p > n hj 
Lerner [34] by combining his decomposition argument with the argu- 
ments in [5]. 

Two- weight inequalities were first considered by Muckenhoupt [37], 
who noted that the same proof as in the one-weight case immediately 
shows that for all p, 1 < p < oo, [u, v) G Ap if and only if the maximal 
operator satisfies the weak {p,p) inequality. However, Muckenhoupt 
and Wheeden [38] soon showed that while the two-weight Ap condition 
is necessary for the strong {p,p) inequality for the maximal operator 
and the strong and weak type inequalities for the Hilbert transform, 
it is not sufficient. This led Muckenhoupt and Wheeden to focus not 
on the structural or geometric properties of Ap weights but on their 
relationship to the maximal operator, in particular, the fact that w G 
Ap was necessary and sufficient for the maximal operator to be bounded 
on L^{w) and L^' {w^~^'). They made the following conjecture that is 
still open: a sufficient condition for the Hilbert transform to satisfy the 
strong {p,p) inequality H : L^{v) — )■ L^{u), 1 < p < oo, is that the 
maximal operator satisfies the pair of inequalities 

(1.4) M : LP{v) ^ LP{u), M : L^' {u'-p') L^' (v'-^'). 
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Bump Ap conditions were first considered by Neugebauer [41] who 
showed the following striking result: a pair of weights {u, v) satisfies 
(1.2) with power bumps A{t) = f^, A{t) = t^^ for some r > 1 if 
and only if there exist w & Ap and positive constants Ci, C2 such that 
Ciu{x) < w{x) < C2v{x). From this condition we immediately get 
a large number of two-weight norm inequalities as corollaries to the 
analogous one-weight results. In particular, we get the two inequalities 
(1.4). An immediate question was whether this condition could be 
weakened and still get that the maximal operator satisfies M : L^{v) — )■ 
U'{u). This was answered in [43], where it was shown that a sufficient 
condition for (1.4) was that the pair of weights satisfies (1.2) with 
A G Bpi and 13 G Bp. The centrality of these Bp conditions is shown by 
the fact that they are sharp within the scale of Orlicz bumps as shown 
in [43]. This led naturally to the following version of the conjecture 
of Muckenhoupt and Wheeden: a sufficient condition on the pair of 
weights (u, t') for any singular integral to satisfy T : L^{v) — > L^iu) is 
that (1.2) holds. Progress on this conjecture was made in [11, 5, 34]. 
Theorem 1.3 completely solves it for the Hilbert and Riesz transforms 
and the Beurling-Ahlfors operator, and as we noted above it is the best 
possible result in the scale of Bp bumps. See [7] for further details and 
references on this topic. 

In the past decade, a great deal of attention has been focused on 
proving that "testing conditions" are necessary and sufficient for two- 
weight norm inequalities for singular integrals. (See Nazarov, Treil 
and Volberg [39, 53, 40] and the recent preprints by Lacey, Sawyer and 
Uriarte-Tuero [29, 30].) More precisely, given a singular integral T, it 
is conjectured that T : U'{v) — )■ U'{u) if and only if for every cube Q, 



The necessity of these conditions is immediate. The best known results 
are for p = 2; partial results (with additional hypotheses) are known for 
other values of p. These results are of great interest not only because of 
the elegance of this conjecture but also because of their connection with 
T6-theorems on non-homogeneous spaces (see [53] and the references 
it contains). 

Testing conditions and Ap bump conditions are not readily compa- 
rable: they represent two fundamentally different approaches to the 
two- weight problem. While both approaches are important, we believe 
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that bump conditions have several advantages over testing conditions. 
First, they are universal, geometric conditions: they are independent 
of the operators and any pair yields norm inequalities for a range of 
operators. Second, they are much easier to check than the testing con- 
ditions, and it is very easy to construct examples of weights that do 
and do not satisfy a given bump condition. (For many examples and a 
general technique for constructing them, see [7].) Third, they are not 
tied to L^, unlike testing conditions where the transition from p = 2 
to all p has proved to be very difficult. (In this regard, we note that 
in [39] it was claimed — without proof — that in the specific case they 
were considering, testing conditions were not sufficient.) 

Maximal singular integrals. Given a singular integral T with con- 
volution kernel K, recall that the associated maximal singular integral 
is defined by 



Somewhat surprisingly, both Theorem 1.1 and Theorem 1.3 remain true 
if the singular integral is replaced by the associated maximal singular 
integral. 

Theorem 1.4. Given p, 1 < p < oo, andw G Ap, then inequality (1.1) 
holds if T is replaced by T*, where T is the Hilbert transform, a Riesz 
transform or the Beurling-Ahlfors operator. Similarly, if the pair (m, v) 
satisfies (1.2), then inequality (1.3) holds ifT is replaced by T^,. 

In the one- weight case. Theorem 1.4 was proved very recently by 
Hytonen et al. [23]. Their proof used a very general family of "max- 
imal" dyadic shift operators and a characterization of the two-weight 
norm inequalities for maximal singular integrals due to Lacey, Sawyer 
and Uriarte-Tuero [29]. In the two- weight case this result is new. In 
both the one and two-weight case our approach is to prove the corre- 
sponding result for the associated maximal dyadic shift operators. 

Remark 1.5. Very recently, Lerner [35] has proved Theorem 1.4 in the 
one- weight case for general Calderon-Zygmund maximal singular inte- 
gral operators when p > 3. 

Dyadic paraproducts and constant Haar multipliers. Let A de- 
note the collection of dyadic cubes in M. We consider two operators 
defined on the real line. A function h is in dyadic BMO, we write 
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b e BMO"^, if 

= sup -fo/pdx^ < oo, 

where bj = b{x) dx. Given a dyadic interval J, J+ and J„ are its right 
and left halves, and the Haar function hj is defined by 

hi{x) = \i\-'/^{xi_{^)-xiA^))- 

Define the dyadic paraproduct irh by 

T^bfix) = ^fi{b, hi)hi{x). 

For an overview of the history and properties of the dyadic paraproduct, 
we refer the reader to Pereyra [42]. 

Theorem 1.6. Given a function b G BMO'^, and p, 1 < p < oo, then 
for all w E Ap, 

max ( 1 

\\Trbf\\LP{w) <Cp\\b\\^4[w]^^ ||/||Lf(«,)- 

Furthermore, given a pair {u,v) that satisfies (1.2), then 

hbf\\LP{u) < c||6||*,d||/IUp(^). 

In the one- weight case. Theorem 1.6 was first proved by Beznosova [2] 
using Bellman function techniques. A different proof that avoided Bell- 
man functions but used two- weight inequalities was given in [23]. 

Given a sequence a = {a/j/gA ^ define the constant Haar mul- 
tiplier Ta by 

Tafix) = ^aj (/, hj) hj{x). 
/ga 

If «/ = 1, then Ta is the identity operator. For more on the properties 
of these operators, see Pereyra [42]. The analog of Theorem 1.6 is true 
for constant Haar multipliers. 

Theorem 1.7. Given a sequence a = {ai}ieA £ and p, 1 < p < 
oo, then for all w G Ap, 

\\Taf\\Lp{w) < Cp\\a\\i^ [w]^^ ^ '""'^Wfhpiw)- 
Furthermore, given a pair {u,v) that satisfies (1.2), then 

\\Taf\\LP{u) < C'||a||^°°||/||LP(i;)- 

In the special case when aj = ±1, Theorem 1.7 was proved by Wit- 
twer [56]. 
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Dyadic square functions. Let A denote the collection of dyadic 
cubes in M". Given Q G A, let Q be its dyadic parent: the unique 
dyadic cube containing Q whose side-length is twice that of Q. The 
dyadic square function is the operator 

Saf{x)=iY,{fQ-fQfxQ{x)\ , 
Vqga / 

where /q = j-^ f{x) dx. For the properties of the dyadic square function 
we refer the reader to Wilson [55]. 

Theorem 1.8. Given p, 1 < p < oo, then for any w G Ap, 

LP{w) ^ Cn^p 

Further, the exponent max ^^zi j ^■^ the best possible. 



The exponent in Theorem 1.8 was first conjectured by Lerner [31] 
for the continuous square function; he also showed it was the best 
possible. In [33] he proved that for p > 2 the sharp exponent is at 
most p'/2 > max(i, ^;^). When p = 2, Theorem 1.8 was proved by 
Wittwer [57] and by Hukovic, Treil and Volberg [22]; this was extended 
to p < 2 by extrapolation in [12] and examples were given to show that 
in this range the exponent is the best possible. 

Remark 1.9. Very recently, Lerner [35] proved the analog of Theo- 
rem 1.8 for continuous square functions. His proof uses the intrinsic 
square function introduced by Wilson [55]. 

Theorem 1.10. Fix p, 1 < p < oo. Suppose 1 < p < 2, and B is a 

Young function such that B G Bp, If the pair [u, v) satisfies 

(1-5) sup \\u^''''\\p,q\\v~^'^\\b,q < oo, 

Q 

then 

(1.6) ||^d/l|L.H<C||/|Up(„). 

Suppose 2 < p < oo, and A and B are Young functions such that 
A G B{j,/2)' and B E Bp. If the pair [u, v) satisfies 

(1.7) snp\\u'/^fQ\\zr'/qB,Q<oo, 



then (1.6) holds. 
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Condition (1.5) is the same condition for the maximal operator to 
map L^iv) to Lp{u), whereas condition (1.7) is more similar to the 
conditions needed for singular integrals, but with a smaller "bump" 
on the left. This is easier to see in the scale of "log bumps." As we 
noted above, for a singular integral we need to take A{t) = t^log(e + 
^-jp-i+<5^ 5 > 0, but for the dyadic square function it suffices to take 
A{t) = t^/^ log(e + t)P/^~-'^+'^, which after rescaling leads to t^log(e + 
^)p/2-i+<5 This difference in the behavior of the dyadic square function 
depending on whether p < 2 or p > 2 was first noted in [7]. There we 
conjectured Theorem 1.10 was true and proved it in some special cases. 
Furthermore, we proved in [7] that this result is sharp in the scale of 
log bumps: if we take p > 2 and let A{t) = tP/^log(e + t)P/^~^+^, then 
the theorem holds for 6 > 0, when A G i?(p/2)', but not for 6 = 0, when 
A^B^p/2Y. 

Remark 1.11. Theorem 1.8 remains true if we replace the Ap condition 
by the dyadic Ap condition (i.e., defined only with respect to dyadic 
cubes). Similarly, Theorem 1.10 remains true if the weight conditions 
are restricted to dyadic cubes. For both theorems this follows by ex- 
amining the proofs and details are left to the interested reader. 

The vector-valued maximal operator. Let M be the Hardy-Little- 
wood maximal operator. Given a vector- valued function / = {/i}, and 
q, 1 < q < oo, define the vector- valued maximal operator Mg by 



The vector-valued maximal operator was introduced by C. Fefferman 
and Stein [14]; for more information see [18]. 

Similar to the dyadic square function, the behavior of the vector- 
valued maximal operator depends on the relative sizes of p and q. 

Theorem 1.12. Fix q, 1 < q < oo. Given p, 1 < p < oo, if w E Ap, 

then 



Theorem 1.12 is new. A slightly worse bound than Theorem 1.12 
was implicit in the literature, but does not appear to have been stated 
explicitly. To get this weaker estimate, note that when g = p, by the 





Further, the exponent max ( —r 




is the best possible. 
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sharp result for the Hardy-Littlewood maximal operator we have that 

||M,/|U,(^) < C,,MX; ( [ \\fix)\\%w{x) dx 

Then by adapting to this context the sharp version of the Rubio de 
Francia extrapolation theorem, we get the exponent ii p > q and 

if p < q. Theorem 1.12 improves this bound for p large. 

Theorem 1.13. Fix q, 1 < q < oo. Suppose 1 < p < q, and B is a 

Young function such that B G Bp. If the pair [u, v) satisfies 

(1-8) sup \\u'/^'\\p,q\\v~'^''\\b,q < oo, 

Q 

then 

(1.9) \\MJ\\Lnu)<c([ \\f{x)\\%v{x)dx 

Suppose q < p < oo, and A and B are Young functions such that 
A G B(^p/gY and B E Bp. If the pair {u,v) satisfies 

(1.10) snp\\u^'/^]l'Q\\v~'/^B,Q<oc, 

Q 

then (1.9) holds. 

When p > q, Theorem 1.13 is sharp in the scale of log bumps. The 
example in [9] shows that if A{t) = t^^'^ Iog(e + t)^/^~^+'', then the result 
fails if 5 = 0, when A ^ -^(p/q)'- (If 5 > 0, A G B^p/qy). In the case 
p < q Theorem 1.13 is not new: it was first proved in [44]; for a different 
proof see [7]. We include it here for completeness and to highlight the 
similarity to the dyadic square function. The case p > g is new; it was 
first conjectured in [7] where a few special cases were proved. 

Remark 1.14. To obtain Theorems 1.12 and 1.13 we first consider the 
corresponding dyadic vector-valued maximal operator and establish 
both results for it. In such observed before for the dyadic 

square function, we ca replace the Ap condition by the dyadic Ap con- 
dition, and in (1.8), (1.10) the sup can be taken over all dyadic cubes. 
Further details are left to the interested reader. 

Organization. The remainder of this paper is organized as follows. 
In Section 2 we gather some basic results, primarily about weighted 
norm inequalities, that are needed in subsequent sections. In Section 3 
we give some preliminary material about the local mean oscillation of a 
function and state the decomposition theorem of Lerner. In Section 4 
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we define the Haar shift operators and prove the key estimate we need 
to apply Lerner's results. In Sections 5-8 we prove our main results. 
In Section 5 we prove our results for singular integrals by proving the 
corresponding results for Haar shift operators. As a corollary to these 
theorems we get our results for dyadic paraproducts and constant Haar 
multipliers. After the proof of Theorems 1.1 and 1.3 we will briefly dis- 
cuss the technical obstructions which prevent us from applying our 
approach directly to a Calderon-Zygmund singular integral. The proof 
for maximal singular integrals is very similar to the proof for singular 
integrals, but since we introduce a new family of dyadic operators we 
give these results in Section 6. The proofs for square functions and 
vector-valued maximal operators are also very similar to those for sin- 
gular integrals, so we will only sketch the proofs of these results in 
Sections 7 and 8, highlighting the key changes. 

We would like to thank Andrei Lerner and Michael Wilson for a 
clarifying discussion about the results in Section 3. 



2. Preliminary results 

In this section we state some basic results that we will need in 
our proofs. The first is the sharp one-weight bound for the Hardy- 
Littlewood maximal operator. This result is due to Buckley [3]; for an 
elementary proof, see Lerner [32]. 

Theorem 2.1. Given p, 1 < p < oo, and any w G Ap, 

iiM/iu.(^) < c^pr^^ipy/'' [w]x: 

The next result is the sharp version of the Rubio de Francia extrap- 
olation theorem due to Dragicevic et al. [12]. 

Theorem 2.2. Suppose that for some Pq, 1 < Pq < oo, there exists 
a{po) > such that for every w G Ap^, a sublinear operator T satisfies 

Then for every p, 1 < p < oo, 

||T/|U.(,.) < c„,T,po,p[^]^; " 'Wfhn^y 

Third, we need a norm inequality for a weighted dyadic maximal 
operator. 
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Lemma 2.3. Let a be a locally integrahle function such that a > 
a.e., and define the weighted dyadic maximal operator 

M,^/(x) = sup / \f{y)\a{y)dy. 
QeA (r{Q) Jq 

x€Q 

Then for all p, 1 < p < oo, 

\\M'j-\\Lr'i.)<p'\\f\\LH^). 

In particular, the constant is independent of a. 

This result is well known and follows by standard arguments. Clearly, 
is bounded on L°°{a) with constant 1. It is also of weak- type 
(1, 1) (with respect to a) with constant 1; this follows from the dyadic 
structure. Then by Marcinkiewicz interpolation we get the desired 
estimate (see [20, Chapter 1, Exercise 1.3.3]). 

In the two-weight case we will need a norm inequality for Orlicz 
maximal operators. This result was proved in [43]. 

Theorem 2.4. Given p, 1 < p < oo, suppose that A is a Young 
function such that A & Bp. Then the Orlicz maximal operator 

M^/(x)=sup||/|U,Q 

x3Q 

is bounded on L^(R"'). 

We will also need a two-weight norm inequality for the maximal 
operator, also from [43]. 

Theorem 2.5. Given p, 1 < p < oo, suppose that B is a Young 
function such that B E Bp. Then for any pair {u, v) that satisfies 

sup ||^^^^*'||p,Q||^^~^^^||i?,Q < OO, 

Q 

we have that 

I|m/|u.(.)<c||/|Up(,). 

To apply Theorem 2.5 we will need to use two facts about Young 
functions and Orlicz norms. First, if A is a Young function such that 
A e Bp,, then A{t) < CtP' for t > 1, so that < A{Ct) for t > 1, and 
therefore, ||m^'^^||p,q < C||m^/^||a_q. Second, given a Young function A 
we have the generalized Holder's inequality, 

/ \fix)g{x)\dx<2\\f\\A,Q\\g\\A.Q- 

JQ 

See [7] for more details. 
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3. Local mean oscillation 

To state Lerner's decomposition argument we must first make some 
definitions and give a few basic results. We follow the terminology and 
notation in [34], which in turn is based on Fujii [16, 17] and Jawerth 
and Torchinsky [27]. We note in passing that many of the underlying 
ideas originated in the work of Carleson [4] and Garnett and Jones [19]. 

Hereafter we assume that all functions / are measurable and finite- 
valued almost everywhere. Given a cube Q and A, < A < 1, define 
the local mean oscillation of / on Q by 

a;,(/,g) = inf ((/-c)xq)*(A|Q|), 

where /* is the (left-continuous) non-increasing rearrangement of /: 

f*{t) = inf {a > : \{x G : > a}\ < t}. 

The local sharp maximal function of / relative to Q is then defined by 

M*Qf{x)= sup cox{f,Q). 

Q'3x 
Q'CQ 

The local sharp maximal function is significantly smaller than the 
C. Fefferman-Stein sharp maximal function: for all A > sufficiently 
small, M{M*Qf){x) < C{n, X)M* f{x). (See [27].) 

A median value of / on Q is a (possibly not unique) number mf{Q) 
such that 

max {\{x e Q : /(x) > m;(g)}|, \{x E Q : f{x) < mf{Q)}\) < M. 

(A different but functionally equivalent definition is given in [17].) The 
median plays the same role for the local sharp maximal function as the 
mean does for the C. Fefferman-Stein sharp maximal function. More 
precisely, for each A, < A < 1/2, 

oox{f,Q) < {{f - mf{Q))xQy{X\Q\) <2uJx{f,Q). 

(The first inequality is immediate; the second follows from (3.3) be- 
low and the fact that for any constant c, mf{Q) — c = mf-dQ)', see 
Lerner [36].) 

To estimate the median and the local mean oscillation we need sev- 
eral properties. For the convenience of the reader we gather them as a 
lemma and sketch their proofs. 

Lemma 3.1. Given a measurable function f and a cube Q, then for 
all \, < X < 1, and p, < p < oo, 

(3.1) ifXQYiMQl) < A"'/^ ||/||L..o=(Q,|Q|-<i.), 
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(3.2) ifxQTiMQl) < (x^y^l^l"^^)'^'- 

Furthermore, 

(3.3) \mfm < uxQrmm-, 

in particular, if f E W for any > 0, then mf{Q) as \Q\ — )■ oo. 

Proof. Inequality (3.2) follows immediately from (3.1). To prove this 
inequality, fix a < {fXQYiMQD- Then 

A-i/^ ||/|U.,o.(Q,|Q|-.,.) > A-VP|g|-i/P«|{^ e Q : |/(x)| > a}^/^ > a. 

Since this is true for all such a, (3.1) follows at once. 

To prove (3.3) we consider two cases. Suppose mf{Q) > 0. Define 

m+ = sup{/3 > TTifiQ) :\{xeQ: f{x) < f3}\ < \Q\/2}; 

then < mf{Q) < m+ so it will be enough to prove m+ < f*{\Q\/2). 
Take any a > such that 

|{xGQ:|/(a;)|>«}|<|Q|/2. 

Then 

\{xEQ: fix) < a}\ >\{xeQ: |/(x)| < a}\ > \Q\/2. 

Hence, for any /3 > a, 

\{xeQ: fix) < f3}\ >\{xeQ: fix) < a}\ > \Q\/2, 

and so P > m_|_. Since this is true for all /3 > a, we have that a > m_|_, 
and taking the infimum of all such a we get that (/xq)*(|<5|/2) > m_|_. 

Finally if mjiQ) < 0, define gix) = —fix). Then —rrifiQ) is 
a median of g and the previous case yields |m/(Q)| = —rufiQ) < 
g*i\Q\/2) = f*i\Q\/2). □ 

Remark 3.2. Inequality (3.1) is central to our proofs as it allows us 
to use weak (1,1) inequalities directly in our estimates. By way of 
comparison, in [5] a key technical difficulty resulted from having to use 
Kolmogorov's inequality rather than the weak (1,1) inequality for a 
singular integral. Overcoming this is the reason the results there were 
limited to log bumps. 

To state Lerner's decomposition theorem, we generalize our notation 
slightly: given a cube Qq, let A((5o) be the collection of dyadic cubes 
relative to Qo- Given Q G A(Qo), Q 7^ Qo, let Q be its dyadic parent: 
the unique cube in A((5o) containing Q whose side-length is twice that 
of Q. 
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Theorem 3.3. ([34]) Given a measurable function f and a cube Qq, 
for each k > 1 there exists a (possibly empty) collection of pairwise 
disjoint cubes {Q'j} C A{Qq) such that ifflk = Uj Q'j > ^^^'^ ^fc+i C flk 
and Iflk+i n Q^l < |- Furthermore, for almost every x E Qq, 

\f{x)-mj{Q,)\<AM*J{x)+AY^u_^{f,Q^XQ^^{x). 

Remark 3.4. If for all j and k we define = Q^-\ Vt^j^i, then the sets 
are pairwise disjoint and \E^\ > \\Q'j\- 

Remark 3.5. Though it is not explicit in [34], it follows at once from the 
proof that we can replace Mf by the corresponding dyadic operator 

Mf''^ , where 



^0 

r#,d 



M*^f{x)= sup u,{f,Q'). 

x&Q'eA(Q) 

Intuitively, one may think of the cubes {Q^} as being the analog of 
the Calderon-Zygmund cubes for the function / — mf{Qo) but defined 
with respect to the median instead of the mean. The cubes are 
maximal dyadic cubes with respect to a dyadic local sharp maximal 
operator. The terms on the right-hand side of the above inequality 
then play a role like that of the good and bad parts of the Calderon- 
Zygmund decomposition. A key difference, of course, is that while the 
Calderon-Zygmund decomposition is done at one "scale," the above 
theorem requires that we estimate the local mean oscillation of / at all 
scales. 



4. The Haar shift operators 

To prove Theorems 1.1 and 1.3 we need to prove the corresponding 
inequalities for certain dyadic operators that can be used to approx- 
imate the Hilbert transform, the Riesz transforms and the Beurling- 
Ahlfors operator. We follow the approach used in [28] and consider si- 
multaneously a family of dyadic operators — the Haar shift operators — 
that contains all the operators we are interested in. 

Let A be the set of dyadic cubes in M". For our arguments we 
properly need to consider the sets A^^t, s e W^, t > 0, of translations 
and dilations of dyadic cubes. However, it will be immediate that all of 
our arguments for dyadic cubes extend to these more general families, 
so without loss of generality we will restrict ourselves to dyadic cubes. 

We define a Haar function on a cube Q G A to be a function hq such 
that 
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(a) supp(/iq) C Q\ 

(b) if Q' G A and Q' C Q, then hq is constant on Q'; 

(c) ii/^qIIoo < igr^/^; 

(d) Jq hqlx) dx = 0. 

Given an integer r > 0, a Haar shift operator of index r is an operator 
of the form 

Hrfix) = Y^ aQ,^Qn{f^hQ,)hQ„{x), 
QgA Q',Q"eA(Q) 

2-^"|Q|<|Q'|,|Q"| 
where ag'^Q" is a constant such that 

We say that H^- is a CZ Haar shift operator if it is bounded on L^. 

An important example of a Haar shift operator when = 1 is the 
Haar shift (also known as the dyadic Hilbert transform) H'^, defined 
by 

H'^fix) = J2{f^hj){hj_{x) -hj^ix)), 

where, as before, given a dyadic interval /, /+ and /_ are its right and 
left halves, and 

hi{x) = \i\-'/^{xiAx)-xiA^))- 

Clearly hj is a Haar function on / and one can write H'^ as a Haar 
shift operator of index r = 1 with ajij" = ±1 for /' = /, /" = I±. 
and aj'jrr = otherwise. These are the operators used by Peter- 
mi chl [46, 47] to approximate the Hilbert transform. More precisely, 
she used the family of operators H^ ^, s G M, t > 0, which are defined 
as above but with the dyadic grid replaced by its translation by s and 
dilation by t. The Hilbert transform is then the limit of integral aver- 
ages of these operators, so norm inequalities for H follow from norm 
inequalities for H^^ by Fatou's lemma and Minkowski's inequality. Sim- 
ilar approximations hold for the Riesz transforms and Beurling-Ahlfors 
operator, and we refer the reader to [48, 49] for more details. 

To apply Theorem 3.3 to the Haar shift operators we need two lem- 
mas. The first is simply that CZ Haar shift operators satisfy a weak 
(1, 1) inequality. The proof of this is known but we could not find it in 
the literature, except when r = — in this case H^^ is a constant Haar 
multiplier and the proof is given in [42]. Therefore, we provide a brief 
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sketch of the details. Here and below we will use the following nota- 
tion: given an integer r > and a dyadic cube Q, let Q'^ denote its r-th 
generation "ancestor": that is, the unique dyadic cube Q'^ containing 
Q such that \Q^\ = 2™|g|. 

Lemma 4.1. Given an integer r > 0, there exists a constant Cr.n such 
that for every t > 0, 

\{xeW:\HJ{x)\>t}\<^ [ \f{x)\dx. 

Proof. Fix t > and form the Calderon-Zygmund decomposition of 
/ at height t. Decompose / as the sum of the good and bad parts, 
g + b. The estimate for g is standard. For b, since Lebesgue measure 
is doubling, it suffices to show that the set 

\{xeW\iUjQ]) : \Hrbix)\ >t/2}\ 

has measure 0. Fix j and x G M" \ QJ; then we would be done if 
we could show that Hrbj{x) = 0. Fix a term aQ'^Q"{bj, hQ')hQ'/{x) in 
the sum defining Hj-bj{x). If this is non-zero, then hQii{x) ^ 0, so 
Q" n \ Q] ^ 0. Since Q" C Q, Q n \ Q] ^ 0. On the other 
hand, since bj{x) dx = 0, {bj^hqi) ^ only if Q' C Qj, which in 
turn implies that Q C QJ, a contradiction. □ 

Our second lemma is a key estimate that is a sharper variant of a 
result known for Calderon-Zygmund singular integrals (see [27]) and 
whose proof is similar. For completeness we include the details. 

Lemma 4.2. Given r > 0, let be a GZ Haar shift operator. Fix X, 
< A < 1. Then for any function f , every dyadic cube Qo, and every 
X e Qo, 

uxiHJ,Qo)<^-f \f{x)\dx, 

M*^^{H^f){x)<^M'^f{x). 

Proof. It suffices to prove the first inequality; the second follows imme- 
diately from the definition of MJq^. Fix Qo and write Hr as the sum 
of two operators: 

Hrf{x) = Hr{fXQl){x) + Hr{fXR-\Ql){x)- 

We claim the second term is constant for all x G Qo- Let Q be 
any dyadic cube. Then the corresponding term in the sum defining 
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Hr{fXR"\Q-o)i^) is 

(4.1) (^Q',Q"{fXR^\Ql,hQ>)hQn{x). 

Q',Q"&A{Q) 
2-""|Q|<|Q'|,|Q"| 

We may assume that Q" fl Qo 7^ (otherwise we get a zero term); 
since Q" C Q, this imphes that Q Qq 7^ 0. Similarly, we have 
g n (M" \ Ql) ^ 0. Therefore, C Q, so \Qq\ < 2-™|g| < \Q"\. 
Hence, Qq C Q" and hqn is constant on Qq. Thus, (4.1) does not 
depend on x and so is constant on Qq. 
Denote this constant by Ht-J^Qq)] then 

\{x G go : \H,f{x)-H,f{Qo)\ > t}\ = \{x G Qo : \H,{fxQr,){x)\ > t}\. 

Since is a CZ Haar shift operator it is weak (1, 1). Therefore, by 
inequality (3.1), 

oJxiHJ,Qo) < {iHJ-HJiQo))xQoyiMQo\) 

< X'^\\Hr{fXQ-o)\\L^'^(Qo,\Qo\-'dx) < \f{x)\dx. 

□ 

5. Singular integrals, paraproducts and Haar multipliers 

In this section we prove Theorems 1.1, 1.3, 1.6 and 1.7. The principal 
results are the first two for singular integrals; the results for paraprod- 
ucts and constant Haar multipliers are variations of these and we will 
only sketch the changes. We will also indicate the technical obstacles in 
attempting to apply our results to general Calderon-Zygmund singular 
integrals. 

One weight inequalities: proof of Theorem 1.1. As we discussed 
in the previous section, to prove Theorem 1.1 it will suffice to establish 
the analogous result for Haar shift operators. 

Theorem 5.1. Given an integer r > and a CZ Haar shift operator 
Hr, and given p, 1 < p < 00, then for any w G Ap, 

\\Hrf\\LP{w) < Cr,n,p[w]Ap ^ ' "'^^l f\\ LP{w) ■ 

Proof of Theorem 5.1. By Theorem 2.2 it will suffice to prove that 

\\Hrf\\ < Cr,n[w\A2\\f\\L^{w)- 

Fix w & A2 and fix /. By a standard approximation argument we may 
assume without loss of generality that / is bounded and has compact 
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support. Let M", 1 < j < 2", denote the n-dimensional quadrants 
in M": that is, the sets x x ■ ■ ■ x where = [0, oo) and 
J- = (-00,0). 

For each j, 1 < j < 2", and for each > let Qnj be the dyadic 
cube adjacent to the origin of side length 2^ that is contained in M". 
Since Q^j G A, A{Qj^j) C A. Because H'^ is a CZ shift operator, it is 
bounded on L^. Thus, since / G L^, by (3.3) and (3.2), murfiQNj) — > 
as — !■ oo. Therefore, by Fatou's lemma and Minkowski's inequality, 

||-f^r/||L2(«,) < liminf / \Hrf{x) - mH,f{QN,j)\'^w{x) dx ] 

j=l \'"^N,j / 

Hence, it will suffice to prove that each term in the sum on the right 
is bounded by Cr,n[w]A2\\f\\L2M- 

Fix j and let Qa^ = Qnj- By Theorem 3.3 and Lemma 4.2, for every 
X G Qn we have that 

(5.1) \HJ{x)-mH^f{Qr,)\ 



< 4 M_f {HJ) (a;) + 4 ^ (HJ, Q':)xq. (x) 




./in- 

= a,„,M/(x) + a,ni^(x), 



where Pj^ = {QjY- We get the desired estimate for the first term from 
Theorem 2.1 with p = 2: 

||M/|U2(Q^,^) < \\Mf\\L2^^)<Cn[wU\\f\\LH^)- 

To estimate F we use duality. Fix a non-negative function h G LP'{w) 
with ||/i||/^2(^) = 1; then by Remark 3.4 and Lemma 2.3 we have that 




22 DAVID CRUZ-URIBE, SFO, JOSE MARIA MARTELL, AND CARLOS PEREZ 

< Cr,n Ma. V / Mt-,{fw){x)Mih{x) dx 

< Cr,n Ma, ! MU{fw){x)Mih{x) dx 

< a,n Ma. { I Mi.,{fw){xfw{x)-^ dx\ ' 

X ( [ Mth{xfw{x) da!) 

1/2 



< Cr,n Ma2 ( / \f{x)w{x)\'^w{x) ^dx 
X I / h{xYw{x) dx 



= Cr,n Ma2 ( I \f{x)\^w{x) dx\ 

If we take the supremum over all such functions we conclude that 

\\F\\L-i(Q^,w) < Cr,n Ma2 ||/||L2(t«)- 

Combining our estimates we have that 

1/2 



\Hrf{,x) - mH^f{QN)\^w{x) dxj < Cr,n M A, || / IU2{«;) , 

and this completes the proof. □ 

Two weight inequalities: Proof of Theorem 1.3. To prove Theo- 
rem 1.3 it will suffice to establish the corresponding result for the Haar 
shift operators. We record this separate result. 

Theorem 5.2. Given an integer t > 0, let he a CZ Haar shift 
operator. Given p, 1 < p < oo, and let A and B he Young functions 
such that A G Bpi and B G Bp. Then for any pair [u, v) such that (1.2) 
holds we have that 

WHrfWLPiu) < C\\f\\LP{v)- 

Proof of Theorem 5.2. The proof is very similar to the proof of Theo- 
rem 5.1 replacing the A2 estimates with an argument from the second 
half of the proof of the main theorem in [5]; therefore, we omit many 
of the details. 
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We argue as in the one-weight case and with the same notation; it 
will suffice to prove 

( [ \HJix) - mnMNWui^) dx) < C||/|Up(.) 
\JQn / 

and we use (5.1). The estimate of the term containing the maximal 
operator is straightforward: by Theorem 2.5 we have M : — )■ 
W^u) since the pair {u,v) satisfies (1.2). Therefore, by duality (this 
time with respect to Lebesgue measure) it is enough to show that for 
every non-negative h & U' (M") with = 1, 



F{x)u{xfl^h{x) rfx<C||/|Up(,). 
We apply Remark 3.4 and the generalized Holder's inequality to get 



/ < 



\f{x)\dx-f u{xY/Ph{x)dx\E''\ 



j,k ■"'i -^Qj 



j,k 



By convexity, ||?i^^''^|U,Q'= ^ 2"'^'^"'"-'^^||n^/P||^ pfe, so since the pair {u,v) 
satisfies (1.2), 



C V / M^{fv'/ni^)Mj^h{x)dx 



I < 



<C MB{fv^/P){x)M^h{x)dx. 



Since A G Bp/ and B G Bp, by Theorem 2.4, M§ is bounded on and 
is bounded in L^' . The desired estimate now follows by Holder's 
inequality. □ 

General Calderon-Zygmund singular integrals. Key to the proofs 
of Theorems 5.1 and 5.2 are the sharp estimates for the local mean os- 
cillation in Lemma 4.2. If we were to try to extend these proofs to an 
arbitrary Calderon-Zygmund singular integral T, then we would have 
to estimate the local mean oscillation by a sum (see [27]): 



oo „ 

.{Tf,Q)<Cj2^-4 \f{x)\dx. 

J2'Q 



i=0 

If we use this estimate in the proof of Theorem 5.1, then we still 
get that T is bounded (since the sum is bounded by 2 infj-gg M/(x)), 
but we get an additional factor of [w]a2- The proof of Theorem 5.2 
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can be modified to handle this sum, but to get convergence you need 
the additional assumption that p > n. This is the approach used by 
Lerner [34]. This (seemingly artificial) restriction p > n also appears 
in [5]. It would be very interesting to find a refinement of Theorem 3.3 
that would let us remove this restriction. Alternatively, it is tempting 
to conjecture that the estimate above could be improved by replacing 
2~* by 2~(""'"^)', which would be sufficient to adapt the proofs in both 
the one and two-weight case. However, it is not clear that such an 
inequality is true, even for singular integrals with smooth kernels. 

Dyadic paraproducts and Haar multipliers: Proof of Theo- 
rems 1.6 and 1.7. The proof of Theorem 1.6 is essentially identical 
to the proof of the corresponding results for singular integrals once we 
prove the analog of Lemma 4.2: 



The proof follows as before. The dyadic paraproduct is a local oper- 
ator, since for any J G A, /i/ is constant on proper dyadic sub-intervals 
of /, and so, given a fixed dyadic interval Iq, 7^b{fXM.\io) is constant 
on Iq. Furthermore, irh is bounded on and satisfies a weak (1,1) 
inequality: for every t > 0, 



For a proof, see Pereyra [42]. 

Theorem 1.7 is actually a special case of Theorems 5.1 and 5.2, since 
the constant Haar multipliers are clearly Haar shift operators of index 
r = 0. The dependence on 1 1 Ct! 1 1 £oc follows at once by linearity. 



In this section we prove Theorem 1.4. To do so, we will follow the ap- 
proach used by Hytonen et al. [23] and actually prove the corresponding 
result for a family of "maximal" dyadic shift operators. The underlying 
dyadic operators are a generalization of the Haar shift operators de- 
fined in Section 4. As noted in [23], the results for the maximal singular 
integrals associated to the Hilbert transform, the Riesz transforms and 
the Beurling-Ahlfors operator are gotten by the same approximation 
arguments as we discussed above. 

We begin by defining the appropriate shift operators. To distinguish 
them from the operators defined above, we will refer to them as gen- 
eralized Haar shift operators. (In [23] they are simply referred to as 




Q 




6. Maximal singular integrals 
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Haar shift operators, but our change in terminology should not cause 
any confusion.) We say that an operator T is a generalized Haar shift 
operator of index r > if 

Tf = ^{f,9Q)lQ, 

where the functions 7q are such that: 
(a) supp(7q) C Q; 

{h) if Q' G A and Q' d Q with \Q'\ < 2-^" |Q|, then 7q is constant 
on Q'] 

[c) 1170II00 < igr^/^ 

The functions qq also have these properties. Finally we assume that 
the functions 7q, Qq are such that T extends to a bounded operator on 
L^. Together, these hypotheses imply that T is of weak- type (1, 1) (see 
[23]). Examples of generalized Haar shift operators include the dyadic 
paraproducts and their adjoints. 

Associated with a generalized Haar shift operator T is the maximal 
Haar shift operator 



TJ = sup \TJ\ = sup 

e>0 e>0 



QeA 

IQI>e" 



We again have that is bounded on and is of weak-type (1, 1) (see 
[23]). 

Our main result for maximal Haar shift operators is the following. 

Theorem 6.1. Let T be a generalized Haar shift operator of index 
T > 0, and let be the corresponding maximal Haar shift operator. 
Then, for every p, 1 < p < 00, and for all w G Ap, 



\T*f\\LP(w) < Cr,n,p[^]A^ ^ \\J \\LP{w)- 



Furthermore, if the pair of weights {u,v) satisfies (1.2), then 

\\T*f\\LP{u) < C\\f\\LP(v)- 

The proof of Theorem 6.1 is very much the same as the proofs of 
Theorem 5.1 and 5.2, so we will only describe the differences between 
the two arguments. First, if / is bounded and has compact support, 
suPe>o \t^tj{Q) \ — as \Q\ — )■ 00. Indeed, by (3.3) and (3.2), 

1 /2 

sup|m^^;(Q)| <2V^^supfi- / \TJ\'dx) < IQI"^/^ ||TJ|U., 

e>0 e>0 Vl^/I Jq J 
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and the right-hand term tends to as \Q\ — )■ oo since is bounded on 
L^. Now fix j, 1 < j < n, and as before let Qn = Qnj- Then in the 
one-weight case by Fatou's lemma we have that 



/ \T^f{x)\'^w{x)dx 




In the two-weight case we get the same inequahty with 2 replaced by 
p and w with the weight u. 

Fix e > and apply Theorem 3.3. To continue the proof we need 
an analog of Lemma 4.2 that takes into account the supremum. This 
in turn reduces to showing the following: given A, < A < 1, and a 
dyadic cube Qq, for every x G Qo 

(6.1) supa;A(TJ,Qo)<%/ \fix)\dx. 

£>o A Jqt 

Given inequality (6.1) the remainder of the proof in both the one and 
two- weight case proceeds exactly as before. 

To prove (6.1) we proceed as in the proof of Lemma 4.2. Fix e > 0; 
since is linear, 

We claim that the second term is constant for all x ^ Qq] denote it by 
Tg(Qo)- Assuming this for the moment, we have that 

\{x e Qo : \TJ{x) - TJ{Qo)\ > t}\ = \{x E Qo : \T,{fxQi,){x)\ > t}\ 

<\{xeQo:mfXQi,)ix)>t}\ 

<j[ \f{x)\dx, 

where we have used that is of weak-type (1, 1). Inequality (6.1) 
follows at once from this and (3.1): 

sup uJxiTJ, Qo) 

e>0 

< sup {{TJ - TJ(Qo))XQo)*(A|Qo|) < \f{x)\dx. 

e>0 A Jqt 
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It remains to show that T^f{Qo) is indeed a constant. Fix x G Qo; 
then 

Te{fXR^\Ql){x) = {fXR^\Ql,9Q)lQ{x). 
QeA 

|Q|>e" 

We may restrict the sum to those cubes Q satisfying Q n Qq and 
Q n (M" \ Ql) 7^ since otherwise we get terms equal to 0. In this 
case, Ql C Q, and consequently Qo C Q with |Qo| < 2~'^"' \Q\. This 
implies that 7q is constant on Qo which proves our claim. 

7. The dyadic square function 

To prove our results for the dyadic square function we must first give 
a version of Lemma 4.2. The key change, however, is that we prove it 
not for Sdf but for (S^/)^. 

Lemma 7.1. Fix A, < A < 1. Then for any function f , every dyadic 
cube Qo, and every x G Qo, 

^x{{S,f)\Qo)<^[j^ \f{x)\dx 

Proof. It suffices to prove the first inequality; the second follows imme- 
diately from the definition of MJq^. Fix Qo] then for every x G Qo we 
can decompose Sdf{xY as 

sj{xr= \fQ - fQ\"xQ{x) + Y i/q-/q|'- 

qga qga 

QCQo QDQo 

The second term is a constant; denote it by SdfiQoY- Furthermore, 
we have that for x G Qo, 

< Sdf[xf - SJiQo? =J2\fQ- fQl'xQix) < Sd{fXQo)ix)'. 

QgA 
QCQo 

Hence, since Sd is weak (1, 1) (see, for instance, [55]), for every t > 
we have that 

\{xeQo:\Sdfixf-SdfiQoy\>t}\ 

< \{x G Qo : \S,{fXQo){^)\ > t'/'}\ <%j \f{x)\dx. 
Therefore, by (3.1) with p = 1/2, 
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tuxiiSdff,Qo) 

< {{{Sdfr - Sdf{Qor)XQoyiMQo\) < ^ (£ • 

□ 

One weight inequalities: Proof of Theorem 1.8. The proof is a 
variation of the proof of Theorem 5.1 and we describe the main changes. 
The first is that rather than proving this result for p = 2, we choose 
p = 3, so that 1/2 = {p — Then by Theorem 2.2 it will suffice to 
prove that for any w G A3, 

1 

||'S'd/|U3(^) < Cn[w]XJ\f\\L3(n,)- 

Fix w E A3 and j, I < j < 2". As before, let Qn = Qnj- Then, 

\ 2/3 

\Sdf{x)\Mx)dx] 



< liminf ( [ \Sdf{xf - m(5,/)2(g;v)|'/'u;(x) dx] 
\Jqm j 

3.3 and Lemma 7.1, 
(7.1) \Sdf{xf-ra^sMQ^)\ 



\ 2/3 



' Qn 

By Theorem 3.3 and Lemma 7.1, for every x G Qat we have 

\2 



<aM/(x)2 + a V (/ 



2 

\f{x)\dx\ Xq^X^) 



= CnMf{xy + C^F{x). 
To estimate the ffist term we use Theorem 2.1 with p = 3: 

||(M/)2||^3/.(Q,,^) < ||M/||i3(^) < C„MA3||/|li3(^). 

To estimate F we use duality. Fix a non-negative function h G L^{w) 
with ||/i||L3{«,) = 1; then Remark 3.4 and Lemma 2.3 yield 

/ F{x) h{x) dx = \ -j-^ \f{x)\dx\ / w{x)h{x) dx 

Jqn \Jq^ j Jq^ 



2 



2 



\f{x)\w{xY^'^w{x) 
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X — ^^^^ / h{x)w{x) dx 



<C^[w]a,Y. I Mt_,^,{fw'/'){xfMth{x)dx 

<C^[w]a, I Mi.,,,{fw'l^){xfMth{x)dx 

< Cn[wU ( f M^.,/,{fw'/')ixfwi^)-'/'dx) ' 



\ 1/3 

M^h[xfw[x) dx J 

\ 2/3 

<Cn\w\A,[ I \fix)wixy/'\'wix)-'/^dxj 

1/3 

h{xf'w{x) dx 

n 

= Cn[w]Ajf\\ls(^^). 

Taking the supremum over all such functions h we conclude that 
If we combine the two estimates we get 

1/3 



and the desired inequality follows as before. 

The exponent max (i, is the best possible. As we noted above, 
for p < 2 specific examples were constructed by Dragicevic et al. [12]. 
For p > 2, a proof was sketched by Lerner [31], adapting an argu- 
ment for singular integrals due to R. Fefferman and Pipher [15]. For 
completeness we give the details. 

If 2 < p < 3, then the sharpness of this exponent follows at once by 
extrapolation. For suppose there existed po in this range such that the 
best possible exponent satisfied «(po) < Then by Theorem 2.2, 

we get that the exponent in the weighted inequality is 

a{po) max |^1, ^ < 1, 

contradicting the fact that the best possible exponent is 1. 
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We now consider the case p > 3. Suppose to the contrary that there 
exists a non-decreasing function such that (j){t)/t^/'^ — )■ as t — cxd, 
and suppose that for some po > 2, 

(7-2) \\Sdf\\LPO{w) < Cn,po0(MAp„)||/||LPo(,«). 

We will show that this implies for all p > Pq that 
(7.3) \\S,fh.<C^<p{C2p)\\f\\L.. 

Below we will give an example to show that this is a contradiction. 

To prove (7.3), fix p > po and fix a non- negative function h G L^^P''^ , 
II^IIl(p/po)'{r") — 1- Define the Rubio de Francia iteration algorithm 
(see [7]) 

p. _ V 

fc=0 " IIl(p/po)'(R") 

Then it follows from this definition that ||-R^||j;^(p/po)'(r") 

< 2 and 

[Rh]Ap^^ < [Rh]Ai < 2 < Cn,poP- 

Therefore, by (7.2) and Holder's inequality. 



Sdf{xY"Kx)dx < Sdf{x)P°Rh{x)dx 

< Cn^pomhuj^' [ f{xrRh{x) dx < c„,„>(c„,,„p)^«ii/iii°(^„). 

Inequality (7.3) now follows by duality, giving us the desired contra- 
diction. 

It remains to show that (7.3) cannot hold. This result is known: see, 
for instance, Wang [54]. For completeness, here we construct a simple 
example of a function / on the real line such that > cp^^^||/ Hp- 

Define the function / on M by 

oo 

f{x) = ^X(2-2i-i,2-2.)(x). 
j=0 



Then 




It- 



yj=0 



2X1/P 



< 1. 



To estimate the norm of Sdf, let Fj = /g., i > I, denote the average 
of / on the interval Qi = [0, 2~*). Then repeating the above calculation 
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shows that 

j=i 

Since the integrals of / on Q2i and Q2i~i are the same, F2j_i = |. 
Therefore, given i > 2, if 2-2«-i < x < 2-2« 



5 



5d/(x)2 > |F2, - = ^ > Cl0g(l/X). 

l<j<* 



The same estimate (with a smaller constant c) holds when 2 2* < ^ < 
2-2i+i_ Therefore, 



i^d/iip > c n iog(i/xr/2rfxj > c f ^p/^ 



2e-^ > cv"\ 



,k=3 

where to get the last estimate we drop all the terms in the sum except 
for k = [p] + 2. Combining these two estimates, we see that 

\\S,f\\,>cp'/^f\\„ 

which is what we wanted to prove. 



Two weight inequalities: Proof of Theorem 1.10. Fix p, 1 < 

p < 00. Then, arguing as before it suffices to show that 

We again use (7.1). To estimate the term containing M, note that we 
have 

ll(M/)2|L,/.(g,,.) < ||M/||2,(„) < C||/||2,(,), 

where we have used Theorem 2.5 and the fact that (n,w) satisfies (1.5) 
when 1 < p < 2 or (1.7) when p> 2. 

To estimate F we consider two cases. Suppose first that 1 < p < 2. 
Then we use that p/2 < 1, inequality (1.5), and Theorem 2.4 and the 
fact that B E Bp to get 

[ F{xy/^ u{x) dx<y2 f / \fi^)\dx] u{Q^) 

Jqn \Jq^ J 

<CY^j u{x)dx\\v~'n\^, Wn\Q.\E^\ 

j,k -^Qj ' ^ ' ^ 

<Cy / MB{fv^/P){xY dx 
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<C [ MBifv^^P)ixy dx 

Combining these two estimates we get the desired inequahty. 

Now suppose that p > 2. In this case the proof is very similar 
to the proof of Theorem 5.2 and we highhght the changes. To use 
duahty with respect to Lebesgue measure, fix a non-negative function 
h e with ||/i|L{p/2)' = 1. Then (1.7) gives 

F{x)u{x)^^Ph{x) dx 

<C^|/ \f{x)\dx] / u{xf/Ph{x)dx\E!j\ 

j,k \ / 



Q 



CJ2[ MB{fv^'n{xfMAh{x)dx 



< _ 



<C MB{fv^/P){xfMAh{x)dx 

J«n- 

11/11 

where we have used Holder's inequality, Theorem 2.4 and the fact that 
A e B{j,/2)' and B G Bp. The desired estimate follows at once if we 
take the supremum over all such functions h. 



8. The vector- valued maximal operator 

Our two results for the vector-valued maximal operator are exact 
parallels of our results for the dyadic square function. Formally, the 
change only requires replacing "2" by "g", 1 < g < oo, and in fact, the 
proofs do adapt readily as we will sketch below. 

As with singular integral operators, in order to prove sharp results for 
vector- valued maximal operator, we need to consider a dyadic operator. 
Recall that the dyadic maximal operator is defined by 

M^/(x) = sup/ \f{y)\dy. 
QeAjQ 
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Given g > 1 and / = {fi}, define the dyadic vector valued maximal 
operator by 

By an argument that goes back to C. Fefferman and Stein [14] (see also 
[50] and [18]), the maximal operator can be approximated by the dyadic 
maximal operator and the analogous operator defined on all translates 
of the dyadic grid. Therefore, by a straightforward argument using 
Fatou's lemma and Minkowski's inequality, to prove weighted norm 
inequalities for the vector-valued maximal operator it suffices to prove 
them for M^. (For the details of this argument, see [7].) 

Again like the dyadic square function, the key estimate for the dyadic 
vector- valued maximal operator is to control the local mean oscillation 

of (Mtf)'- 

Lemma 8.1. Fix A, < A < 1, and q, 1 < q < oo. Then for any 
function f = {fi}, every dyadic cube Qq, and every x ^ Qo, 

u:,mtfr,Qo)<^(J^^ \\f{x)h.dx 

Mt:imify){x) < ^M\\\f{.)\\,.){xY. 

Proof. The second estimate again follows from the first. To prove the 
first, fix Qq. Then for every x G Qo and every i > 1, we observe that 



M'^/,(x) =max M\faQ,){x), sup 4- \fi{y)\dy . 

QeA Jq 

QocQ 

The second term on the right is constant; using this we define 

<}\ i/<? 



Ko=\f^ \ sup / my)\dy] ] 



For X G Qo, Mg/(x)'' > Kq. We also have the following elementary 
inequality: for every a, 6 > 0, < max(a, b) — b < a. Combining these 
facts we get that 

oo 

< M'fixy - Kl <Y.M\f.0CQ,){xr = MlifxQo){xy. 

i=l 
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Since the vector- valued maximal operator is weak (1,1) (see [14]), for 
any t > 0, 

\{xeQo:\Mtf{xy-K'o\>t}\ 

< \{x G go : m'(/XQo)(^) > t'/"}] < §f / \\f{x)h. dx. 

Therefore, by (3.1) with p = 1/q, 

< m'!,fr-K)xQoymo\) ^ (£ 11/(^)11.^^^)'- 

□ 

One weight inequalities: Proof of Theorem 1.12. As we noted 
above, the proof is very similar to the proof of Theorem 1.8, and so we 
briefly sketch the key details. By Theorem 2.2 it will suffice to prove 
it for the special case when p = q + 1. For this value of p we have that 
ip/q)' = p and 1 — p' = — 1/g. As before, fix w E Ap and Qn] we will 
show that 

\MJ{xy - m^^.^^^{Qr,)f'iw{x) dxj 

_ q/p 

<Cn,q[w]Ap( I \\f{x)fggW{x)dx 



By Theorem 3.3 and Lemma 8.1, for every x E Qn, 
(8.1) \M'j{xY-m^-^.^^^{Q^)\ 



< 



Q,M(||/(-)|kO(^)' + a,,5^ / ||/(x)||,.rfx XQ^ix) 



j,k V 



= C^,gM{\\f{-)\U.){xr + Cn,gF{x). 

To estimate the first term we use Theorem 2.1. The estimate for F uses 
duality: fix a non-negative function h G ^{w) with =1 (recall 

that [p/q)' = p). Then, proceeding as before, we use the definition of 
Ap = Aq+i to show that 

F{x) h{x) w{x) dx 

N 

<CMa, I M^_,/,(||/(-)||..i/;^/^)(x)''i/;(x)"i/^MXx)i/;(x)i/^dx. 
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Finally, we use Holder's inequality, Theorem 2.3 and then take the 
supremum over all such functions h to get the desired estimate. 

To prove that the exponent max hest possible, we 

consider two cases. If p < q + 1, then the exponent is ^j^, which 
is the same as the sharp exponent for the scalar maximal function. 
Therefore, the examples given by Buckley [3] immediately adapt to the 
vector-valued maximal operator. 

If p > g+l, then we can argue exactly as we did for the dyadic square 
function, replacing the exponent 1/2 by 1/q. Therefore, to show that 
the exponent 1/g is sharp we need to show that there exists a vector- 
valued function / = {/j} such that ||Mg/||p > cp^/'^||/||p. But such a 
function is given by Stein [51, p. 75]. 

Two weight inequalities: Proof of Theorem 1.13. The proof is 
again nearly the same as the proof of Theorem 1.10 for the dyadic 
square function, so we only sketch the highlights. Fix p, 1 < p < oo; 
then it suffices to show that 



MJ{xY -m^^,^^^{Q^)\^l^u{x)dx<C / \\f{x)\\lv{x)dx. 



N 



We use (8.1). We estimate the term involving M using Theorem 2.5 
and the fact that {u^v) satisfies (1.8) when 1 < p < g or (1.10) when 
p> q. 

To estimate F we consider two cases. Suppose first that 1 < p < g, 
then 

F{xY'^u{x)dx<y^{-j- \\f{x)\\,,d^ u{Q)), 



and this term is estimated exactly as before. Combining these two 
estimates we get the desired inequality. 

When p > q, we use duality with respect to Lebesgue measure and 
consider a non- negative function h such that ||/i||^(p/9)' = 1- Then, 

F{x)u{xy/Ph{x)dx 



<Cn,,yA-t \\f{x)\U.dx] i u{xy/^h{x)dx\F 




nfcl 



From here we follow the argument in the proof of Theorem 1.10, re- 
placing 2 by g. 
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